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Abstract 

In order to investigate the routing aspects of small-world networks, Klein- 
berg [13] proposes a network model based on a d-dimensional lattice with long- 
range links chosen at random according to the d-harmonic distribution. Kleinberg 
shows that the greedy routing algorithm by using only local information performs 
in 0(lg^ n) expected number of hops, where n denotes the number of nodes in 
the network. Martel and Nguyen [17] have found that the expected diameter of 
Kleinberg's small-world networks is 0(lgn). Thus a question arises naturally: Can 
we improve the routing algorithms to match the diameter of the networks while 
keeping the amount of information stored on each node as small as possible? 

Existing approaches for improving the routing performance in the small-world 
networks include: (1) Increasing the number of long-range links [2, 15]; (2) Ex- 
ploring more nodes before making routing decisions [14]; (3) Increasing the local 
awareness for each node [10, 17]. However, all these approaches can only achieve 
©((Ign)^"*""^) expected number of hops, where e > denotes a constant. We ex- 
tend Kleinberg's model and add two augmented local links for each node, which 
are connected to nodes chosen randomly and uniformly within Ig^ n Mahattan dis- 
tance. Our investigation shows that these augmented local connections can make 
small- world networks more navigable. 

We show that if each node is aware of O(lgn) number of neighbors via the 
augmented local links, there exist both non-oblivious and oblivious algorithms that 
can route messages between any pair of nodes in O(lgnlglgn) expected number 
of hops, which is a near optimal routing complexity and outperforms the other 
related results for routing in Kleinberg's small-world networks. Our schemes keep 
only 0(lg^ n) bits of routing information on each node, thus they are scalable with 
the network size. Our results imply that the awareness of O(lgn) nodes through 
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augmented links is more efHcient for routing than via the local links [10, 17]. 

Besides adding new light to the studies of social networks, our results may also 
find applications in the design of large-scale distributed networks, such as peer-to- 
peer systems, in the same spirit of Symphony [15]. 

1 Introduction 

A well-known study by Milgram in 1967 [18] shows the small-world phenomenon [9], also 
called "six degree of separation" , that any two people in the world can be connected by a 
chain of six (on the average) acquaintances, and people can deliver messages efficiently to 
an unknown target via their acquaintances. This study is repeated by Dodds, Muhamad, 
and Watts [8] recently, and the results show that it is still true for today's social network. 
The small- world phenomenon has also been shown to be pervasive in networks from nature 
and engineering systems, such as the World Wide Web [21, 1], peer-to-peer systems [2, 
16, 15, 22], etc. 

Recently, a number of network models have been proposed to study the small-world 
properties [19, 21, 13]. Watts and Strogatz [21] propose a random rewiring model whose 
diameter is a poly-logarithmic function of the size of the network. The model is con- 
structed by adding a small number of random edges to nodes uniformly distributed on a 
ring, where nodes are connected densely with their near neighbors. A similar approach 
can also be found in Ballabas and Chung's earlier work [6], where the poly- logarithmic 
diameter of the random graph is achieved by adding a random matching to the nodes 
of a cycle. However, these models fail to capture the algorithmic aspects of a small- 
world network [13]. As commented by Kleinberg in [13], the poly-logarithmic diameter 
of some graphs does not imply the existence of efficient routing algorithms. For example, 
the random graph in [6] yields a logarithmic diameter, yet any routing using only local 
information requires at least y/n expected number of hops (where n is the size of the 
network) [13]. 

In order to incorporate routing or navigating properties into random graph mod- 
els, Kleinberg [13] develops a new model based on a li-dimensional torus lattice with 
long-range links chosen randomly from the rf-harmonic distribution, i.e., a long-range 
link between nodes u and v exists with probability proportional to Dist{u,v)~'^, where 
Dist{u,v) denotes the Mahattan distance between nodes u and v. Based on this model, 
Kleinberg then shows that routing messages between any two nodes can be achieved in 
0(lg^ n) ^ expected number of hops by applying a simple greedy routing algorithm using 
only local information. This bound is tightened to ©(Ig^ n) later by Barriere et al. [3] 
and Martel et al. [17]. Further research [16, 14, 17, 10] shows that in fact the O(lg^n) 
bound of the original greedy routing algorithm can be improved by putting some extra 
information in each message holder. Manku, Naor, and Wieder [16] show that if each 

^The logarithmic symbol Ig is with the base 2, if not otherwise specified. Also, we remove the ceiling 
or floor for simplicity throughout the paper. 
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message holder at a routing step takes its own neighbors' neighbors into account for mak- 
ing routing decisions, the bound of routing complexity can be improved to where 
q denotes the number of long-range contacts for each node. Lebhar and Schabanel [14] 
propose a routing algorithm for 1-dimensional Kleinberg's model, which visits O ( jg'^^+g) ) 
nodes on expectation before routing the message, and they show that a routing path 
with expected length of can be found. Two research groups, Fraigniaud et 

al. [10], and Martel and Nguyen [17], independently report that if each node is aware of 
its O(lgn) closest local neighbors, the routing complexity in d-dimensional Kleinberg's 
small- world networks can be improved to 0{\gn\g^'^^^''' n) expected number of hops. The 
difference is that [17] requires keeping additional state information, while [10] uses an 
oblivious greedy routing algorithm. Fraigniaud et al. [10] also show that ©(Ig^n) bits 
of topological awareness per node is optimal for their oblivious routing scheme. In [17], 
Martel and Nguyen show that the expected diameter of a rf- dimensional Kleinberg net- 
work is 0(lgn). As such, there is still some room for reducing the routing complexity, 
which motivates our work. 

Other small- world models have also been studied. In their recent paper [20], Nguyen 
and Martel study the diameters of variants of Kleinberg's small-world models, and pro- 
vide a general framework for constructing classes of small- world networks with ©(Ign) 
expected diameter. Aspnes, Diamadi, and Shah [2] find that the greedy routing algo- 
rithms in directed rings with a constant number of random extra links given in any 
distribution requires at least f2(lg^ n/ Iglgn) expected number of hops. Another related 
models are the small- world percolation models [16, 4, 7, 5]. The diameters of these mod- 
els are studied by Benjamin et al. [4], Coppersmith et al. [7] and Biskup [5]. The routing 
aspects of the percolation models, such as the lower bound and upper bound of greedy 
routing algorithms with 1-lookahead, are studied in [16]. 

Applications of small- world phenomenon in computer networks include efficient lookup 
in peer-to-peer systems [16, 2, 15, 22], gossip protocol in a communication network [12], 
flooding routing in ad- hoc networks [11], and the study of diameter of World Wide 
Web [1], etc. 

1.1 Our Contributions 

We extend Kleinberg's structures of small-world models with slight change. Besides hav- 
ing long-range and local links on the grid lattice, each node is augmented with two extra 
links connected to nodes chosen randomly and uniformly within Ig^ n Mahattan distance. 
Based on this extended model, we present near optimal algorithms for decentralized rout- 
ing with O(lgn) augmented awareness. We show that if each node is aware of O(lgn) 
number of nodes via the augmented neighborhood, there exist both non-oblivious and 
oblivious routing algorithms that perform in O(lgnlglgn) expected number of hops (see 
Theorem 1 and Theorem 2). Our investigation constructively show that the augmented 
local connections can make small-world networks more navigable. 
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Table 1: Comparisons of our decentralized routing algorithms with the other existing schemes. In the first three schemes 
(in [13, 2, 15, 16, 14]), we suppose that each node has q long-range contacts, while in the next three schemes (in [17, 10] 
and this paper), we suppose that each node has one long-range contact. A routing protocol is oblivious if the message 
holder makes routing decisions only by its local information and the target node, and independently of the previous routing 
history, otherwise, it is said to be non-oblivious. 

A comparison of our algorithm with the other existing schemes is shown in Table 1. 
Our decentralized routing algorithms assume that each node can compute a shortest path 
among a poly-logarithmic number of known nodes. Such an assumption is reasonable 
since each node in a computer network is normally a processor and can carry out such 
a simple computation. Our schemes keep O(lg^n) bits of routing information stored on 
each node, thus they are scalable with the increase of network size. Our investigation 
shows that the awareness of O(lgn) nodes through the augmented links is more efficient 
for routing than via the local links [10, 17]. 

We note that besides adding new light to the studies of social networks such as Mil- 
gram's experiment [18], our results may also find applications in the design of large-scale 
distributed networks, such as peer-to-peer systems, in the same spirit of Symphony [15]. 
Since the links in our extended model are randomly constructed according to the proba- 
bilistic distribution, the network may be less vulnerable to adversarial attacks, and thus 
provide good fault tolerance. 

1.2 Organization 

The rest of the paper is organized as follows. Section 2 gives notations for Kleinberg's 
small-world model and its extended version with augmented local connections. Section 3 
gives some preliminary notations for decentralized routing. In Section 4, wc propose both 
non-oblivious and oblivious routing algorithms with near optimal routing complexity in 
our extended model. Section 5 gives the experimental evaluation of our schemes. Section 6 
briefly concludes the paper. 

2 Definitions of Small- World Models 

In this section, we will give the definition of Kleinberg's small-world model and its ex- 
tended version in which each node has extra links. For simplicity, we only consider the 
one-dimensional model with one long-range contact for each node. In addition, we as- 
sume that all links are directed, which is consistent with the real-world observation, for 
example, person x knows person y, but y may not know x. 
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Definition 1 (Kleinberg's Small- World Network (KSWN) [13]) A Kleinberg's 
Small-World Network, denoted as K, is based on a one- dimensional torus (or ring) [n] = 

[0, 1, • • ■, n]. Each node u has a directed local link to its next neighbor {u + 1) mod n on 
the ring. We refer to this local link as Ring-link (or R-link for short), and refer to node 
(■u + 1) mod n as the R-neighbor of node u. In addition, each node has one long-range 
link to another node chosen randomly according to the l-harm,onic distribution, that is, 
the probability that node u sends a long-range link to node v is Pr[n v] = ^ .Dist(uv) ' 
where Dist{u,v) denotes the ring distance ^ from u to v, and Zy^ = Y2zj^u Dist{u z) ' 

We 

refer to this long-range link as the Kleinberg-link (or K-link for short), and refer to 
node V as a K-neighbor of node u if a K-link exists from u to v. 

Our extended structure introduces several extra links for each node. Its definition is 
given below. 

Definition 2 (KSWN with Augmented Local Connections (KSWN*)) A Klein- 
berg's Small-World Network with Augmented Local Connections, denoted as K* , has the 
same structure of KSWN, except that each node u in fC* has two extra links to nodes 
chosen randomly and uniformly from the interval {u, u + Ig^ n] . We refer to these two 
links as the augmented local links (or AL-links for short), and refer to node v as a 
AL-neighbor of node u if a AL-link exists from u to v. 

There are in total four links for each node in a KSWN*: one R-link, one K-link, two 
AL-hnks. We refer to all nodes linked directly by node u as the immediate neighbors 
of u. Our extended structure retains the same 0{1) order of node degree as that of 
Kleinberg's original model. 

3 Decentralized Routing Algorithms 

Based on the original model, Kleinberg presents a class of decentralized routing algo- 
rithms, in which each node makes routing decisions by using local information and in 
a greedy fashion. In other words, the message holder forward the message to its imme- 
diate neighboring node, including its K-neighbor, which is closest to the destination in 
terms of the Mahattan distance. Kleinberg shows that such a simple greedy algorithm 
performs in O(lg^n) expected number of hops. The other existing decentralized routing 
algorithms [2, 15, 14, 10, 17, 16] mainly rely on three approaches to improve routing 
performance: (1) Increasing the number of long-range links [2, 15]; (2) Exploring more 
nodes before making routing decisions [14]; (3) Increasing the local awareness for each 
node [10, 17]. However, so far using these approaches can only achieve 0((lgn)^+'') ex- 
pected number of hops in routing, where e > 0. Although the scheme in [16], where each 
node makes routing decision by looking ahead its neighbors's neighbors, can achieve an 
optimal 0(lgn/lglgn) bound, their result depends on the fact that each node has at 
least fl{\gn) number of K- links. 

^or Mahattan distance for multi-dimensional models. 
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There are normally two approaches for decentralized routing: oblivious and non- 
oblivious schemes [10]. A routing protocol is oblivious if the message holder makes 
routing decisions only by its local information and the target node, and independently of 
the previous routing history. On the other hand, if the message holder needs to consider 
certain information of the previous routing history to make routing decisions, the protocol 
is referred to as non-oblivious. The non-oblivious protocol is often implemented by adding 
a header segment to the message packet so that the downstream nodes can learn the 
routing decisions of upstream nodes by reading the message header information. The 
scheme in [10] is oblivious, while the schemes in [14] and [17] are non-oblivious. 

We refer to the message holder as the current node. For the current node x, we define 
a sequence of node sets Tq.Ti,- ■ -.Ti,- ■ ■, where Tq = {x}, Ti = { -u's AL-neighbors, 
Vm G To}, T2 = {-u's AL-ncighbors, Vm G Ti}, and so on. We refer to Tj as the set of 
nodes in the ith level of AL neighborhood, and let Hi = lJ^<,;Tj denote the set of all nodes 
in the first i levels of AL neighborhood. At a certain level i of AL neighborhood, we may 
also refer to ifj-i as the set of previously known nodes. Let Li — — denote the 
set of new nodes discovered during the ith level of AL neighborhood. Let Ax{k) — Hk 
denote the augmented local awareness (or AL awareness for short) of a given node in a 
KSWN*, where each node is aware of the first k levels of its AL neighborhood. 

In Section 4, we will show that there exists a sufficiently large constant a such that 
|A2:(lglgn)| > Ign/o", based on which we propose both non-oblivious and oblivious rout- 
ing algorithms running in O(lgnlglgn) expected number of hops and requiring O(lg^n) 
bits of information on each node. 

Our near optimal O(lgnlglgn) bound on the routing complexity outperforms the 
other related results for Kleinberg's small-world networks. To our knowledge, our algo- 
rithms achieve the best expected routing complexity while requiring at most 0(log^ n) 
bits of information stored on each node. 

4 Near Optimal Routing with O(lgn) Awareness 
4.1 Augmented Local Awareness of O(lgn) 

In this subsection, we will show that |Aa;(lglgn)|, the number of distinct nodes that node 
X is aware of via the first Iglgn levels of AL neighborhood, is not less than Ign/a for a 
constant a, which, as will be shown in Lemma 3, is sufficiently large to guarantee that 
Ax{\g\gn) contains a K-link that jumps over half distance (Suppose that the destination 
node is at a certain large distance from the current node). These results are useful for 
the subsequent analysis of our oblivious and non-oblivious routing schemes. 

Lemma 1 Let Ax{\g\gn) denote the AL awareness of node x in a KSWN* JC* , where 
each node is aware of Iglgn levels of AL-neighbors. Then 




where a denotes a sufficiently large constant and ip denotes a positive constant. 
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Proof: Throughout the proof, we assume that \Hi\ < for all 1 < i < Iglgn, 
otherwise, the lemma already holds, since |Aa;(lglgn)| = |i/igign| > Ign/a. We will 
show that at each level of AL neighborhood, the probability that each AL-link points to 
previously known nodes is small so that a large number of distinct nodes will be found 
via the first Ig Ig n levels of AL neighborhood. 

Consider the construction of a AL-link for the current node x. By definition of 
KSWN*, each AL-hnk of x is connected to a node randomly and uniformly chosen from 
the interval {x,x + Ig^n], that is, each AL-link of x points to a node in the interval 
(x, x+lg^ n] with probability (lgn)~^. By assumption, there could be no more than Ign/a 
previously known nodes in the interval {x, a;-|-lg^ n]. Thus, the probability for a AL-link of 
a given node to point to a previously known node is at most (lgn/a)-(lgn)^^ = (crlgn)^^. 
Thus, the probability for a AL-link of x to point to a new node is at least 1 — {a\gn)~^. 
There are in total at most 2 • |ifigign| < 2lgn/a number of AL-links, so the probability 
for all AL-links to point to new nodes is at least (1 — (crlgn)"^)^'^"/'^ > 1 — ^ for 
sufficiently large n. Here we use the fact {l+xY > 1 + ax ior x > —1 and a >1. When 
(7 is a sufficiently large constant, we have Pr[ > ^ ] > for a positive constant 
i/j — 1 — > 0. Thus, the proof of Lemma 1 is completed. I 

4.2 Non-Oblivious Decentralized Routing 

Our non-oblivious routing algorithm is given as follows: Initially the source node s finds 
in its AL awareness As{\g\gn) an intermediate node z that is closest to the destination, 
and then computes a shortest path tt from s to z in As{\g\gn). Before routing the 
message, s adds the information about shortest path tt to the message header. Once 
the message passes a node on the shortest path tt, the next stop is read off the header 
stack. When the message reaches node z, node z can tell that it is an intermediate 
target by reading the message header and then route the message to its K-ncighbor. 
Such processes are repeated until the message reaches a certain node close enough to the 
destination node. After that, Kleinberg's plain greedy algorithm can be used to route 
the message effectively to the target node. Given a message M, a source node s and a 
target node t in a KSWN* /C*, the pseudocodes of our non-oblivious algorithm running 
on the current node x are given in Algorithm 1. 

Next we will analyze the performance of the Algorithm 1. We first give a basic lemma, 
which provide a lower bound and an upper bound on the probability of the existence of 
a K-link in Kleinberg's small-world networks. Its proof can be found in Appendix A. 

Lemma 2 Let Pt[u-^v] denote the probability that node u sends a K-link to node v in 
a KSWN* /C*. Suppose that a < Dist{u,v) < b, then < Pt[u^^v] < where Ci 
and C2 are constants independent of n. 

In Lemma 1, we have shown that Pr[ |Aj;(lglgn)| > \gn/a ] is at least a positive 
constant for a sufficiently large constant a. Based on this result. Lemma 3 shows that 
the probability for Axilglgn) to contain a K-link jumping over half distance is at least a 
positive constant. 
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Algorithm 1 



Input: the source s, the target t and the message M. 
Initialization: 

Current node ^ s. 

Set the header stack of the message M to be empty, 
while Distance between the current node and the destination > (Ign)^lglgn do 
if the header staclt of the message M is empty then 
Route the message M to a;'s K-neighbor y. 

Find an intermediate node z in Ay(lglgn) whose K-neighbor is closest to t (ties are broken arbitrarily). 
Compute a shortest path tt : xq = y,xi,---,xt = z from y to z, and push the shortest path information -k : xi,---,xt = z 
into the header stack of the message M. 
else 

Pop up the first node Xi from the header stack and route the message M to node xi. 
end if 
end while 

Final phase (Kleinberg's greedy algorithm): 

Route the message M to an immediate neighbor of x that is closest to the target t, until it reaches t. 



Lemma 3 Suppose that the distance between the current node x and the target node t 
in a KSWN* /C* is Dist(x,t) > Ig^nlglgn. Then with probability at least a positive 
constant, node x's AL awareness Ax{\g\gn) contains a K-neighbor within Dist{x,t)/2 
distance to the target node t . 

Proof: Let A denote the event that |74-r(lglgn)| > By Lemma 1, we have Pr[^] > 
ip for a constant ip > 

Let Bi{t) denote the set of all nodes within I ring distance to t. Let Pr[x-^Si(t)] 
denote the probability that x's K-neighbor is inside the ball Bi{t). 

Let m — Dist{x,t). By Lemma 2, the probability for a K-link to point to a given 
node inside the ball Bmit) is at least — r— , so we have 

2 ^ ' m Ig n ' 

Fr[x^B^{t)] > \B^{t)\ ■ = - ■ 4^ > 

2 2 mlgn 2 mlgn Ign 

where C3 is a constant. 

Since Dist(x,t) > Ig^nlglgn and each AL-link spans a distance no more than Ig^n, 
the nodes in AL awareness A^(lglgn) are all between the current node x and the target 
node t. Let Pr[74a;(lglgn) — ^Bm^it)] denote the probability that at least one node in 
Ax(lglgn) has a K-neighbor in Biq.{t). Then we have 

Pr[A,(lglgn)-^i?^(t)] > Pr[A,(lglgn)-^S™(t) | A] ■ Vi[A] 

>(1-(1-T^)^)-V^ 
^ Ign ' 

>V'(l-e-^), 

which is larger than a positive constant. At the last step, we obtain (1~ ^ 
by using the fact that (1 + < for 6 G M and a; > 0. I 

Lemma 4 Suppose that the distance between the current node x and the target node t 
in a KSWN* K,* is Dist(x,t) > Ig^nlglgn. Then after at most O(lgnlglgn) expected 
number of hops, Algorithm 1 will reduce the distance to within Ig^nlglgn. 
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Proof: Since Dist{x,t) > Ig^nlglgn, all known nodes in x's AL awareness Ax{lglgn) 
are between the current node x and the target node t. We can apply the result in Lemma 3 
to analyze Algorithm 1. 

We refer to the routing steps from a given node x to any node within ^^.(Iglgn) as 
an indirect phase. The routings in different indirect phases arc independent from each 
other. By Lemma 3, the probability that node x's AL awareness ^^.(Iglgn) contains 
a K-neighbor within Dist{x,t)/2 distance to the target node t is at least a positive 
constant, so after at most 0(1) expected number of indirect phases, Algorithm 1 will find 
an intermediate node whose K-link jumps over half distance. Since each indirect phase 
takes at most Iglgn hops and the maximum distance is n, after at most 0{\gn Iglgn) 
expected number of hops, the message will reach a node within Ig^ n Ig Ig n distance to 
the target node t. I 

Lemma 5 Suppose that the distance between the current node x and the target node t in 
a *KSWN *K, is Dist{x,t) < Ig^nlglgn. Then using the final phase of Algorithm 1 (i.e. 
using Kleinberg's greedy algorithm) can route the message to the target node t in 0{\gn) 
expected number of hops. 

Proof: When the distance Dist{x,t) < Ig^nlglgn, the final phase in Algorithm 1 is 
executed. By Kleinberg's results in [13], after at most 0(lg^(lg^ nlglgn)) = O(logn) 
expected number of steps, the message will be routed to the destination node. I 
Combining the above lemmas, it is not difficult for us to obtain the routing complexity 
of Algorithm 1. 

Theorem 1 In a KSWN* K,* , Algorithm 1 performs in 0(lgn \g\gn) expected number 
of hops. 

4.3 Oblivious Decentralized Routing 

In our oblivious scheme, when the distance is large, the current node x first finds in 
Ax{\g\gn) whether there is an intermediate node 2;, which contains a K-ncighbor within 
Dist{x,t)/2 distance to the target node, and is closest to node x in terms of AL-links 
(any possible tie is broken arbitrarily). Next, node x computes a shortest path tt from x 
to z among the AL awareness Ax(\.glgri), and then routes the message to its next AL- 
neighbor on the shortest path tt. When the distance is small, Kleinberg's plain greedy 
algorithm is applied. 

Given a message M, a source s and a target t in a KSWN* /C*, the pseudocodes of 

our oblivious algorithm running on the current node x are given in Algorithm 2. 
Lemma 6 Suppose that the distance between the current node x and the target node t in 

a KSWN* fC* is Dist{x,t) > c(lgn)^ Iglgn, where c is a sufficiently large constant. Then 

after at most O(lglgn) expected number of hops, Algorithm 2 will reduce the distance to 

within Dist{x,t)/2. 
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Algorithm 2 

Input: the source s, the target t and the message M. 
Initialization: 
Current node <— s. 

while Distance between the current node and the destination > c(lgn)^ lglg»^ do (c is a sufficiently large constant and 
will be given later) 

z *— a. node in Ax{lglgn) that contains a K-neighbor within Dist{x,t)/2 distance to t, and is closest to node x in terms 
of AL-links (ties are broken arbitrarily), 
if node z does not exist then 

Route the message M to an immediate neighbor closest to node t. 
else 

Compute a shortest path n from x to z among Aa;(lglgn). 
if IT consists of only node x itself then 

Route the message M to the K-neighbor. 
else 

Route the message M to the next AL-neighbor on the shortest path it. 
end if 
end if 
end while 

Final phase (Kleinberg's greedy algorithm): 

Route the message M to an immediate neighbor of x that is closest to the target t, until it reaches t. 




Figure 1: Dia gram for oblivious decentralized routing. The shade area represents node x's AL awareness Ax{lg\gn). 
The target node t is on the right side of x. Node r is the midpoint of xt. Node r' is between nodes r and t such that 
Dist{r,r') = Ig^ n Iglgn. Node z is an intermediate node in Ax(lglgn) that contains a K-neighbor in rt (in rr' or r't ) 
and is closest to x in terms of AL-links. 

Proof: As sliown in Figure 1, node r is the midpoint of xt, and node r' is between r and 
t such that Dist{r,r') = Ig^nlglgn. Let z be an intermediate node in ^^.(Iglgn) that 
contains a K-neighbor between r and t, and is closest to x in terms of AL-links. We refer 
to a node z in x's AL awareness Ax{\glgn) as a good intermediate node if it satisfies the 
following two conditions: (1) has a K-neighbor within Dist{x, t)/2 to the target node; (2) 
is closest to node x in terms of AL-links. Let tt : xo = x,xi, ■ ■ ■,Xt = z denote a shortest 
path that x finds from itself to z among the AL awareness ^^..(Iglgn). We divide the 
nex{jf(^ip^^i^r^ptea^^o^j^(K_^gK^i1^^4if^ 9!s!^Kr^te- x and 

the right most node in ^^.(Iglgn) is at most Ig^nlglgn, z's K-neighbor is also within 
Dist{xi, t)/2 to the target node for every Xi on the shortest path vr, that is, node z always 
satisfies the first condition of a good intermediate node for every node Xj. Also, if z is 
an intermediate node closest to x, it is also a closest intermediate node to every Xi on 
the shortest path vr, that is, z also satisfies the second condition of a good intermediate 
node for every node Xj. Therefore, node z will become a fixed good intermediate node 
for all nodes Xj on the shortest path. When this case happens. Algorithm 2 will route the 
message along a shortest path from x to z in an oblivious routing fashion. Thus, in this 
case, after at most Iglgri number of hops, the message will reach a good intermediate 
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node and the routing distance will be reduced by half ^. In the second case, z's K- 
neighbor is within rr' . When this happens, the intermediate node z may change for each 
Xi on the shortest path and the message may not be routed 

along the shortest path as expected by the previous node x. However, we will show that 
the latter case will not happen very likely, since the length of rr' is relatively small when 
Dist{x,t) > c(lgn)^lglgn for a sufficiently large constant c. 

Let J^i denote the event that Ax{lglgn) contains a K- neighbor in r't. By using a 
similar technique in Lemma 3, we can easily obtain that occurs with probability at 
least a positive constant. 

Let J^2 denote the event that Aj.{\g\gn) contains a K- neighbor in rr'. For any node y 
in yl^(lglgn), we have Dist{y, r) > |c(lgn)^ Iglg'^ when c is a sufficiently large constant. 
By Lemma 2, the probability for a node y in ^^^(Iglgn) to send a K-link to a node in rr' 
is at most — tttt^t — tt — • Because there arc in total Ig^nlglgn nodes in rr', a node in 
A^ilg Ig n) has a K- neighbor in rr' with probability at most c(ig„)2(]g]g„).ig^ • Ig^ n Ig Ig n = 
Since |A^(lglgn)| < I + 2 + 2^^ + ■ ■ ■ + 2ieign < 21gn, the event J^2, i.e., A^(lglgn) 
has a K- neighbor in rr', occurs with probability at most ■ 21gn = which is 
smaller than a certain constant when c is a sufficiently large constant. Thus, we have 
Pr[-'J^2] > 7 for a constant 7 > 0, if we choose a sufficiently large constant c. 

Therefore, Pr[-i^2 D-^i] larger than a positive constant, if we choose a sufficiently 
large constant c. Thus, after at most c'lglgn expected number of hops for a constant 
c', the event -'J-^2f]^i will occur, that is, a message will be routed to a node x whose 
AL awareness Ax{lglgn) contains a K- neighbor in r't, but no K- neighbor in rr'. When 
such a node x is reached, the intermediate node z is fixed for every node Xi on a shortest 
path TT : xo — x,xi, • • -jXt — z in an oblivious routing fashion. Then after at most Iglgn 
number of hops, the message will be routed to the fixed intermediate node z, which has 
a K-link jumping over half distance. 

Therefore, after at most c'lglgn + Iglgn = O(lglgn) expected number of hops, the 
distance will be reduced by half. I 

Lemma 7 Suppose that the distance between the current node x and the target node t in 
a KSWN* IC* is Dist{x,t) > clg^nlglgn, where c is a sufficiently large constant. Then 
after at most O(lgnlglgn) expected number of hops, Algorithm 2 will reduce the distance 
to within c Ig^ n\g\gn. 

Proof: The proof is similar to that of Lemma 4, and hence is omitted here. I 

Lemma 8 Suppose that the distance between current node x and the target node t in a 
KSWN* IC* is m < c(lgn)^ Ig Ign, where c is a sufficiently large constant. Then using 
the final phase of Algorithm 2 (i.e. using Kleinberg's greedy algorithm) can route the 
message to the target node t in O(lgn) expected number of hops. 

•^There may be more than one good intermediate nodes z when a tie happens. However, even when 
this happens, the message will still reach one of good intermediate nodes along a shortest path finally. 

Hereinafter, we focus on the case in which the good intermediate node z is unique for the current node 
X. The analysis for the case with multiple good intermediate nodes can be easily obtained. 
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Proof: The proof is similar to that of Lemma 5, and hence is omitted here. I 
Combining the above lemmas, we can easily obtain the following theorem. 

Theorem 2 In a KSWN* K,* , Algorithm 2 performs in 0{\gn Iglgn) expected number 
of hops. 

5 Experimental Evaluation 

In this section, we will conduct experiments to evaluate our schemes and other existing 
routing schemes for Kleinberg's small-world networks. 

We focus on the following four schemes: (a) The original greedy routing algorithm [13] 
in Kleinberg's small-world network with only one long-range contact per node. Each node 
forwards the message to its immediate neighbor closest to the destination; (b) The greedy 
routing algorithm in Kleiberg's small-world network with two long-range contacts per 
node [2, 15]. In the experimental study, we would like to learn how much the additional 
number of long-range links can help routing, (c) The decentralized routing scheme with 
O(lgn) local awareness [10, 17]. With this scheme, we intend to evaluate the degree at 
which the local awareness improve the routing efficiency, (d) Our near optimal routing 
scheme proposed in this paper. We note that most schemes have both non-oblivious and 
oblivious versions. Here we only focus on the non-obhvious version for each scheme. 

5.1 Experimental Setup 

Network Construction: We construct the small- world network based on a ring [0, 1, ■ • 
•,n]. Each node i is connected to its immediate neighbors {i + 1) mod n. Let if„ — 
Sr=i ^1'^ denote the harmonic normalization factor. We then generate a sequence of 
intervals which we call the probabihty intervals, where 1 <i <n — l. Let < /i < 
1/Hn, and ^j^_l-^fj < h ^ j^, where 2 < i < n — 1. Each node i uniformly generates 
a random number x in (0, 1], and then finds the interval that contains x. Suppose that 
X is located in the interval Ik- Node i then forms a long-range link connected to a node 
with the distance k. When each node has multiple long-range contacts, it just generates 
more than one random numbers, and sets up the connections in the same way. 

In the extension of Kleinberg's small-world networks, each node uniformly and ran- 
domly chooses two nodes within the Manhattan distance Ig^ n as its augmented local 
neighbors. 

Messages Generation and Evaluation Metrics: We let each node generate a query 
message with a random destination, and then evaluate the following metrics. 

(1) Average length of routing path is the average number of hops travelled by the 
messages from the source to the destination. 

(2) Storage requirement for each node is the number of information bits required to 
be stored on each node. 
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5.2 Experimental Results 



We vary the number of nodes in the network from 5,000 to 25,000, and evaluate different 
routing schemes, as shown in Figures 2 and 3. For large n, the greedy algorithm with 
increasing number of long-range contacts [2, 15], the decentralized routing algorithm with 
local awareness [10, 17], our near optimal and algorithm all improve Kleinberg's original 
greedy algorithm. Our near optimal scheme can find a shorter routing path than the 
decentralized routing schemes with local awareness [10, 17], while keep almost the same 
storage space on each node. 



- Greedy wilh 1 long-range 

* Greedy wilh 2 long-range 

- O- - Algorithm with log n local 
ar optimal algorithr 




Figure 2: Average length of routing paths for different routing schemes. 



^^^1 Greedy with 1 long-range 
^^^1 Greedy with 2 long-range 
I [ Algonthm with log n local 
200 - I I Our near optimal algorithi 



Figure 3: Storage requirement on each node for different routing schemes. 
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6 Conclusion 



We extend Kleinberg's small-world network with augmented local links, and show that if 
each node participating in routing is aware of O(lgn) neighbors via augmented links, there 
exist both non-oblivious and oblivious decentralized algorithms that can finish routing in 
O(lgnlglgn) expected number of hops, which is a near optimal routing complexity. Our 
investigation shows that the awareness of O(lgn) nodes through the augmented links will 
be more efficient for routing than via the local links [10, 17]. 

Our extended model may provide an important supplement for the modelling of small- 
world phenomenon, and may better approximate the real-world observation. For example, 
each person in a human society is very likely to increase his/her activities randomly 
within some certain communities, and thus is aware of certain levels of "augmented" 
acquaintances. This augmented awareness would surely help delivery the message to an 
unknown target in the society. 

Our results may also find applications in the design of large-scale distributed networks, 
such as distributed storage systems. Unlike most existing deterministic frameworks for 
distributed systems, our extended small-world networks may provide good fault tolerance, 
since the links in the networks are constructed probabilistically and less vulnerable to 
adversarial attacks. 
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Appendix A. Proof of Lemma 2 



Lemma 2. Let Pt[u^^v] denote the probability that node u sends a K-link to node v 
in a KSWN* /C*. Suppose that a < Dist{u,v) < b, then < Pr[u^^v] < where 
ci and C2 are constants independent of n. 

Proof: The probability that node f is a K-neighbor of node u is Pr[M — >t>] = ^^^^ , 
where Dist{u,v) is the ring distance between nodes u and v, and = J2z^v Dist{v z) • 
Observe that Zy = ^^^i where \Ui\ is the set of all nodes at distance i away to 



node V. Since \Ui\ = 0(1), we have Zy = Y^^=i = 6(lgn). 

Since a < Dist(u,v) < b, we have -rr— < Prk-^vl < -f^- for some constants Ci 

— \ ' / — ' feign L i algn' ^ 

and C2 independent of n. Thus the lemma follows. I 
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